CVPR 2016: Las Vegas, NV, USA
2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-8851-1
Oral & Spotlight Session 1-1A
O1-1A: Image Captioning and Question Answering
Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko, Trevor Darrell:
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. 1-10
Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan L. Yuille, Kevin Murphy:
Generation and Comprehension of Unambiguous Object Descriptions. 11-20
Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, Alexander J. Smola:
Stacked Attention Networks for Image Question Answering. 21-29
Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han:
Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction. 30-38
S1-1A: Language and Vision
Scott E. Reed, Zeynep Akata, Honglak Lee, Bernt Schiele:
Learning Deep Representations of Fine-Grained Visual Descriptions. 49-58
Zeynep Akata, Mateusz Malinowski, Mario Fritz, Bernt Schiele:
Multi-cue Zero-Shot Learning with Strong Supervision. 59-68
Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh N. Nguyen, Matthias Hein, Bernt Schiele:
Latent Embeddings for Zero-Shot Classification. 69-77
Roland Kwitt, Sebastian Hegenbart, Marc Niethammer:
One-Shot Learning of Scene Locations via Feature Trajectory Transfer. 78-86
Chuang Gan, Tianbao Yang, Boqing Gong:
Learning Attributes Equals Multi-Source Domain Generalization. 87-97
Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:
Anticipating Visual Representations from Unlabeled Video. 98-106
Oral & Spotlight Session 1-1B
O1-1B: Matching and Alignment
Kwang Moo Yi, Yannick Verdie, Pascal Fua, Vincent Lepetit:
Learning to Assign Orientations to Feature Points. 107-116
Tinghui Zhou, Philipp Krähenbühl, Mathieu Aubry, Qi-Xing Huang, Alexei A. Efros:
Learning Dense Correspondence via 3D-Guided Cycle Consistency. 117-126
Shenlong Wang, Sean Ryan Fanello, Christoph Rhemann, Shahram Izadi, Pushmeet Kohli:
The Global Patch Collider. 127-135
Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony R. Dick, Ian D. Reid:
Joint Probabilistic Matching Using m-Best Solutions. 136-145
Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li:
Face Alignment Across Large Poses: A 3D Solution. 146-155
S1-1B: Segmentation and Contour Detection
Jie Feng, Brian L. Price, Scott Cohen, Shih-Fu Chang:
Interactive Segmentation on RGBD Images via Cue Selection. 156-164
Chen Liu, Pushmeet Kohli, Yasutaka Furukawa:
Layered Scene Decomposition via the Occlusion-CRF. 165-173
Michael Maire, Takuya Narihira, Stella X. Yu:
Affinity CNN: Learning Pixel-Centric Pairwise Relations for Figure/Ground Embedding. 174-182
Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele:
Weakly Supervised Object Boundaries. 183-192
Jimei Yang, Brian L. Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang:
Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. 193-202
Poster Session P1-1
Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony R. Dick, Anton van den Hengel:
What Value Do Explicit High Level Concepts Have in Vision to Language Problems? 203-212
Nati Ofir, Meirav Galun, Boaz Nadler, Ronen Basri:
Fast Detection of Curved Edges at Low SNR. 213-221
Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai:
Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs. 222-230
Huan Fu, Chaohui Wang, Dacheng Tao, Michael J. Black:
Occlusion Boundary Detection via Deep Exploration of Context. 241-250
Zizhao Zhang, Fuyong Xing, Xiaoshuang Shi, Lin Yang:
SemiContour: A Semi-Supervised Learning Approach for Contour Detection. 251-259
Lingxi Xie, Liang Zheng, Jingdong Wang, Alan L. Yuille, Qi Tian:
InterActive: Inter-Layer Activeness Propagation. 270-279
Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-Bin Gao, Jianxin Wu, Jianfei Cai:
Exploit Bounding Box Annotations for Multi-Label Object Recognition. 280-288
Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys:
TI-POOLING: Transformation-Invariant Pooling for Feature Learning in Convolutional Neural Networks. 289-297
Edgar Simo-Serra, Hiroshi Ishikawa:
Fashion Style in 128 Floats: Joint Ranking and Classification Using Weak Data for Feature Extraction. 298-307
Yuhui Quan, Chenglong Bao, Hui Ji:
Equiangular Kernel Dictionary Learning with Applications to Dynamic Texture Analysis. 308-316
Tsun-Yi Yang, Yen-Yu Lin, Yung-Yu Chuang:
Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales. 327-335
Yuan-Ting Hu, Yen-Yu Lin:
Progressive Feature Matching with Alternate Descriptor Selection and Correspondence Enrichment. 346-354
Da Chen, Jean-Marie Mirebeau, Laurent D. Cohen:
A New Finsler Minimal Path Model with Curvature Penalization for Image Segmentation and Closed Contour Detection. 355-363
Yuhua Chen, Dengxin Dai, Jordi Pont-Tuset, Luc J. Van Gool:
Scale-Aware Alignment of Hierarchical Image Segmentation. 364-372
Ning Xu, Brian L. Price, Scott Cohen, Jimei Yang, Thomas S. Huang:
Deep Interactive Object Selection. 373-381
Danna Gurari, Suyog Dutt Jain, Margrit Betke, Kristen Grauman:
Pull the Plug? Predicting If Computers or Humans Should Segment Images. 382-391
Yuka Kihara, Matvey Soloviev, Tsuhan Chen:
In the Shadows, Shape Priors Shine: Using Occlusion to Improve Multi-region Segmentation. 392-401
Loïc Alain Royer, David L. Richmond, Carsten Rother, Bjoern Andres, Dagmar Kainmueller:
Convexity Shape Constraints for Image Segmentation. 402-410
Ertunc Erdil, Sinan Yildirim, Müjdat Çetin, Tolga Tasdizen:
MCMC Shape Sampling for Image Segmentation with Nonparametric Shape Priors. 411-419
Jaesik Park, Yu-Wing Tai, Sudipta N. Sinha, In-So Kweon:
Efficient and Robust Color Consistency for Community Photo Collections. 430-438
Kuldeep Kulkarni, Suhas Lohit, Pavan K. Turaga, Ronan Kerviche, Amit Ashok:
ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Measurements. 449-458
Jin-shan Pan, Zhe Hu, Zhixun Su, Hsin-Ying Lee, Ming-Hsuan Yang:
Soft-Segmentation Guided Object Motion Deblurring. 459-468
Dongliang Cheng, Abdelrahman Kamel, Brian L. Price, Scott Cohen, Michael S. Brown:
Two Illuminant Estimation and User Correction Preference. 469-477
Seung-Hwan Baek, Inchang Choi, Min H. Kim:
Multiview Image Completion with Space Structure Propagation. 488-496
Jiansheng Chen, Gaocheng Bai, Shaoheng Liang, Zhengqin Li:
Automatic Image Cropping: A Computational Complexity Study. 507-515
Neil D. B. Bruce, Christopher Catton, Sasa Janjic:
A Deeper Look at Saliency: Feature Contrast, Semantics, and Beyond. 516-524
Qiaosong Wang, Wen Zheng, Robinson Piramuthu:
GraB: Visual Saliency via Novel Graph Model and Background Priors. 535-543
Anna Volokitin, Michael Gygli, Xavier Boix:
Predicting When Saliency Maps are Accurate and Eye Fixations Consistent. 544-552
Oriel Frigo, Neus Sabater, Julie Delon, Pierre Hellier:
Split and Match: Example-Based Adaptive Patch Sampling for Unsupervised Style Transfer. 553-561
Lilian Calvet, Pierre Gurdjos, Carsten Griwodz, Simone Gasparini:
Detection and Accurate Localization of Circular Fiducials under Highly Challenging Conditions. 562-570
Luis Herranz, Shuqiang Jiang, Xiangyang Li:
Scene Recognition with CNNs: Objects, Scales and Dataset Bias. 571-579
Nicholas Rhinehart, Kris Makoto Kitani:
Learning Action Maps of Large Environments via First-Person Vision. 580-588
Yingying Zhang, Desen Zhou, Siqin Chen, Shenghua Gao, Yi Ma:
Single-Image Crowd Counting via Multi-Column Convolutional Neural Network. 589-597
Junting Pan, Elisa Sayrol, Xavier Giró i Nieto, Kevin McGuinness, Noel E. O'Connor:
Shallow and Deep Convolutional Networks for Saliency Prediction. 598-606
Mohammad Najafi, Sarah Taghavi Namin, Mathieu Salzmann, Lars Petersson:
Sample and Filter: Nonparametric Scene Parsing via Efficient Filtering. 607-615
Saumitro Dasgupta, Kuan Fang, Kevin Chen, Silvio Savarese:
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes. 616-624
Siyu Zhu, Richard Zanibbi:
A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification. 625-632
Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Zequn Jie, Jiashi Feng, Liang Lin, Shuicheng Yan:
Reversible Recursive Instance-Level Object Segmentation. 633-641
Yao Lu, Xue Bai, Linda G. Shapiro, Jue Wang:
Coherent Parametric Contours for Interactive Video Object Segmentation. 642-650
Yong-Jin Liu, Cheng-Chi Yu, Minjing Yu, Ying He:
Manifold SLIC: A Fast Method to Compute Content-Sensitive Superpixels. 651-659
Gayoung Lee, Yu-Wing Tai, Junmo Kim:
Deep Saliency with Encoded Low Level Distance Map and High Level Features. 660-668
Ziyu Zhang, Sanja Fidler, Raquel Urtasun:
Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs. 669-677
Nian Liu, Junwei Han:
DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection. 678-686
Rong Quan, Junwei Han, Dingwen Zhang, Feiping Nie:
Object Co-segmentation via Graph Optimized-Flexible Manifold Ranking. 687-695
Won-Dong Jang, Chulwoo Lee, Chang-Su Kim:
Primary Object Segmentation in Videos via Alternate Convex Optimization of Foreground and Background Distributions. 696-704
Luca Del Pero, Susanna Ricco, Rahul Sukthankar, Vittorio Ferrari:
Discovering the Physical Parts of an Articulated Object Class from Multiple Videos. 714-723
Federico Perazzi, Jordi Pont-Tuset, Brian McWilliams, Luc J. Van Gool, Markus H. Gross, Alexander Sorkine-Hornung:
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. 724-732
Mahmudul Hasan, Jonghyun Choi, Jan Neumann, Amit K. Roy-Chowdhury, Larry S. Davis:
Learning Temporal Regularity in Video Sequences. 733-742
Nicolas Marki, Federico Perazzi, Oliver Wang, Alexander Sorkine-Hornung:
Bilateral Space Video Segmentation. 743-751
Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li:
ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering. 752-760
Oral & Spotlight Session 1-2A
O1-2A: Object Recognition and Detection
Abhinav Shrivastava, Abhinav Gupta, Ross B. Girshick:
Training Region-Based Object Detectors with Online Hard Example Mining. 761-769
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun:
Deep Residual Learning for Image Recognition. 770-778
Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, Ali Farhadi:
You Only Look Once: Unified, Real-Time Object Detection. 779-788
Spyros Gidaris, Nikos Komodakis:
LocNet: Improving Localization Accuracy for Object Detection. 789-798
Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Chen Change Loy:
Sketch Me That Shoe. 799-807
S1-2A: Object Detection 1
Shuran Song, Jianxiong Xiao:
Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images. 808-816
Kai Kang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Object Detection from Video Tubelets with Convolutional Neural Networks. 817-825
Judy Hoffman, Saurabh Gupta, Trevor Darrell:
Learning with Side Information through Modality Hallucination. 826-834
Neelima Chavali, Harsh Agrawal, Aroma Mahendru, Dhruv Batra:
Object-Proposal Evaluation Protocol is 'Gameable'. 835-844
Tao Kong, Anbang Yao, Yurong Chen, Fuchun Sun:
HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection. 845-853
Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari:
We Don't Need No Bounding-Boxes: Training Object Class Detectors Using Only Human Verification. 854-863
Wanli Ouyang, Xiaogang Wang, Cong Zhang, Xiaokang Yang:
Factors in Finetuning Deep Model for Object Detection with Long-Tail Distribution. 864-873
Oral & Spotlight Session 1-2B
O1-2B: Vision with Alternative Sensors
Guy Rosman, Daniela Rus, John W. Fisher III:
Information-Driven Adaptive Structured-Light Scanners. 874-883
Patrick Bardow, Andrew J. Davison, Stefan Leutenegger:
Simultaneous Optical Flow and Intensity Estimation from an Event Camera. 884-892
Achuta Kadambi, Jamie Schiel, Ramesh Raskar:
Macroscopic Interferometry: Rethinking Depth Estimation with Frequency-Domain Time-of-Flight. 893-902
Huaijin G. Chen, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha C. Molnar:
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks Using Angle Sensitive Pixels. 903-912
Katherine L. Bouman, Michael D. Johnson, Daniel Zoran, Vincent L. Fish, Sheperd S. Doeleman, William T. Freeman:
Computational Imaging for VLBI Image Reconstruction. 913-922
S1-2B: Video Analysis 1
Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei:
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images. 923-932
Fanyi Xiao, Yong Jae Lee:
Track and Segment: An Iterative Unsupervised Approach for Video Object Proposals. 933-942
Gao Zhu, Fatih Porikli, Hongdong Li:
Beyond Local Search: Tracking Objects Everywhere with Instance-Specific Proposals. 943-951
Hongkai Yu, Youjie Zhou, Jeff P. Simmons, Craig P. Przybyla, Yuewei Lin, Xiaochuan Fan, Yang Mi, Song Wang:
Groupwise Tracking of Crowded Similar-Appearance Targets from Low-Continuity Image Sequences. 952-960
Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Fei-Fei Li, Silvio Savarese:
Social LSTM: Human Trajectory Prediction in Crowded Spaces. 961-971
Andrii Maksai, Xinchao Wang, Pascal Fua:
What Players do with the Ball: A Physically Constrained Interaction Modeling. 972-981
Poster Session P1-2
Bugra Tekin, Artem Rozantsev, Vincent Lepetit, Pascal Fua:
Direct Prediction of 3D Body Poses from Motion Compensated Sequences. 991-1000
Michael Gygli, Yale Song, Liangliang Cao:
Video2GIF: Automatic Generation of Animated GIFs from Video. 1001-1009
Amir Shahroudy, Jun Liu, Tian-Tsong Ng, Gang Wang:
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. 1010-1019
Bingbing Ni, Xiaokang Yang, Shenghua Gao:
Progressively Parsing Interactional Objects for Fine Grained Action Detection. 1020-1028
Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang:
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning. 1029-1038
Jingjing Meng, Hongxing Wang, Junsong Yuan, Yap-Peng Tan:
From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection. 1039-1048
Zheng Shou, Dongang Wang, Shih-Fu Chang:
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs. 1049-1058
Ke Zhang, Wei-Lun Chao, Fei Sha, Kristen Grauman:
Summary Transfer: Exemplar-Based Subset Selection for Video Summarization. 1059-1067
Yeong Jun Koh, Won-Dong Jang, Chang-Su Kim:
POD: Discovering Primary Objects in Videos Based on Evolutionary Refinement of Object Recurrence, Background, and Primary Object Models. 1068-1076
Waqas Sultani, Mubarak Shah:
What If We Do Not have Multiple Videos of the Same Action? - Video Action Localization Using Web Images. 1077-1085
Lu Zhang, Hayley Hung:
Beyond F-Formations: Determining Social Involvement in Free Standing Conversing Groups from Static Images. 1086-1095
Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang:
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. 1096-1104
Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Rui Wang, Xiaochun Cao:
SketchNet: Sketch Classification with Web Images. 1105-1113
Xiaofan Zhang, Feng Zhou, Yuanqing Lin, Shaoting Zhang:
Embedding Label Structures for Fine-Grained Feature Representation. 1114-1123
Feng Zhou, Yuanqing Lin:
Fine-Grained Image Classification by Exploring Bipartite-Graph Labels. 1124-1133
Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, Qi Tian:
Picking Deep Filter Responses for Fine-Grained Image Recognition. 1134-1142
Han Zhang, Tao Xu, Mohamed Elhoseiny, Xiaolei Huang, Shaoting Zhang, Ahmed M. Elgammal, Dimitris N. Metaxas:
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition. 1143-1152
Yin Cui, Feng Zhou, Yuanqing Lin, Serge J. Belongie:
Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning with Humans in the Loop. 1153-1162
Yaming Wang, Jonghyun Choi, Vlad I. Morariu, Larry S. Davis:
Mining Discriminative Triplets of Patches for Fine-Grained Classification. 1163-1172
Shaoli Huang, Zhe Xu, Dacheng Tao, Ya Zhang:
Part-Stacked CNN for Fine-Grained Visual Categorization. 1173-1182
Kevin Lin, Jiwen Lu, Chu-Song Chen, Jie Zhou:
Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks. 1183-1192
Kilho Son, Daniel Moreno, James Hays, David B. Cooper:
Solving Small-Piece Jigsaw Puzzles by Growing Consensus. 1193-1201
Zhen Zhang, Qinfeng Shi, Julian J. McAuley, Wei Wei, Yanning Zhang, Anton van den Hengel:
Pairwise Matching through Max-Weight Bipartite Belief Propagation. 1202-1210

Albert Haque, Alexandre Alahi, Li Fei-Fei:
Recurrent Attention Models for Depth-Based Person Identification. 1229-1238
Li Zhang, Tao Xiang, Shaogang Gong:
Learning a Discriminative Null Space for Person Re-identification. 1239-1248
Tong Xiao, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification. 1249-1258
Shanshan Zhang, Rodrigo Benenson, Mohamed Omran, Jan Hendrik Hosang, Bernt Schiele:
How Far are We from Solving Pedestrian Detection? 1259-1267
Dapeng Chen, Zejian Yuan, Badong Chen, Nanning Zheng:
Similarity Learning with Spatial Constraints for Person Re-identification. 1268-1277
Ying Zhang, Baohua Li, Huchuan Lu, Atshushi Irie, Xiang Ruan:
Sample-Specific SVM Learning for Person Re-identification. 1278-1287
Faqiang Wang, Wangmeng Zuo, Liang Lin, David Zhang, Lei Zhang:
Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification. 1288-1296
Haoxiang Li, Jonathan Brandt, Zhe Lin, Xiaohui Shen, Gang Hua:
A Multi-level Contextual Model for Person Recognition in Photo Albums. 1297-1305
Peixi Peng, Tao Xiang, Yaowei Wang, Massimiliano Pontil, Shaogang Gong, Tiejun Huang, Yonghong Tian:
Unsupervised Cross-Dataset Transfer Learning for Person Re-identification. 1306-1315
Jiale Cao, Yanwei Pang, Xuelong Li:
Pedestrian Detection Inspired by Appearance Constancy and Shape Symmetry. 1316-1324
Niall McLaughlin, Jesús Martínez del Rincón, Paul C. Miller:
Recurrent Convolutional Network for Video-Based Person Re-identification. 1325-1334
De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, Nanning Zheng:
Person Re-identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function. 1335-1344
Jinjie You, Ancong Wu, Xiang Li, Wei-Shi Zheng:
Top-Push Video-Based Person Re-identification. 1345-1353
Yeong-Jun Cho, Kuk-Jin Yoon:
Improving Person Re-identification via Pose-Aware Multi-shot Matching. 1354-1362
Tetsu Matsukawa, Takahiro Okabe, Einoshin Suzuki, Yoichi Sato:
Hierarchical Gaussian Descriptor for Person Re-identification. 1363-1372
Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu:
STCT: Sequentially Training Convolutional Networks for Visual Tracking. 1373-1381
Juan-Manuel Perez-Rua, Tomás Crivelli, Patrick Bouthemy, Patrick Pérez:
Determining Occlusions from Space and Time Image Reconstructions. 1382-1391
Ju Hong Yoon, Chang-Ryeol Lee, Ming-Hsuan Yang, Kuk-Jin Yoon:
Online Multi-object Tracking via Structural Constraint Event Aggregation. 1392-1400
Luca Bertinetto, Jack Valmadre, Stuart Golodetz, Ondrej Miksik, Philip H. S. Torr:
Staple: Complementary Learners for Real-Time Tracking. 1401-1409
Jiaolong Yang, Hongdong Li, Yuchao Dai, Robby T. Tan:
Robust Optical Flow Estimation of Double-Layer Images under Transparency or Reflection. 1410-1419
Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg:
Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking. 1430-1438
Adel Bibi, Tianzhu Zhang, Bernard Ghanem:
3D Part-Based Sparse Tracker with Automatic Synchronization and Registration. 1439-1448
Zhen Cui, Shengtao Xiao, Jiashi Feng, Shuicheng Yan:
Recurrently Target-Attending Tracking. 1449-1458
Maksim Lapin, Matthias Hein, Bernt Schiele:
Loss Functions for Top-k Error: Analysis and Insights. 1468-1477
Valentina Zantedeschi, Rémi Emonet, Marc Sebban:
Metric Learning as Convex Combinations of Local Models with Generalization Guarantees. 1478-1486
Ziming Zhang, Yuting Chen, Venkatesh Saligrama:
Efficient Training of Very Deep Neural Networks for Supervised Hashing. 1487-1495
Saeid Motiian, Marco Piccirilli, Donald A. Adjeroh, Gianfranco Doretto:
Information Bottleneck Learning Using Privileged Information for Visual Recognition. 1496-1505
Oral & Spotlight Session 2-1A
O2-1A: Recognition and Parsing in 3D


Zhile Ren, Erik B. Sudderth:
Three-Dimensional Object Detection and Layout Prediction Using Clouds of Oriented Gradients. 1525-1533
Iro Armeni, Ozan Sener, Amir Roshan Zamir, Helen Jiang, Ioannis K. Brilakis, Martin Fischer, Silvio Savarese:
3D Semantic Parsing of Large-Scale Indoor Spaces. 1534-1543
Lingyu Wei, Qixing Huang, Duygu Ceylan, Etienne Vouga, Hao Li:
Dense Human Body Correspondences Using Convolutional Networks. 1544-1553
S2-1A: Recognition Beyond Objects


Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton van den Hengel, Heng Tao Shen:
What's Wrong with That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution. 1573-1581
Torsten Sattler, Michal Havlena, Konrad Schindler, Marc Pollefeys:
Large-Scale Location Recognition and the Geometric Burstiness Problem. 1582-1590
Mark Wolff, Robert T. Collins, Yanxi Liu:
Regularity-Driven Building Facade Matching between Aerial and Street Views. 1591-1600
R. T. Pramod, S. P. Arun:
Do Computational Models Differ Systematically from Human Object Perception? 1601-1609
Oral & Spotlight Session 2-1B
O2-1B: Image Processing and Restoration
Timo Hackel, Jan Dirk Wegner, Konrad Schindler:
Contour Detection in Unstructured 3D Point Clouds. 1610-1618
Jin-shan Pan, Deqing Sun, Hanspeter Pfister, Ming-Hsuan Yang:
Blind Image Deblurring Using Dark Channel Prior. 1628-1636
Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee:
Deeply-Recursive Convolutional Network for Image Super-Resolution. 1637-1645
Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee:
Accurate Image Super-Resolution Using Very Deep Convolutional Networks. 1646-1654
S2-1B: Image Processing and Restoration
Nguyen Ho Man Rang, Michael S. Brown:
RAW Image Reconstruction Using a Self-Contained sRGB-JPEG Image with Only 64 KB Overhead. 1655-1663
Kede Ma, Qingbo Wu, Zhou Wang, Zhengfang Duanmu, Hongwei Yong, Hongliang Li, Lei Zhang:
Group MAD Competition? A New Methodology to Compare Objective Image Quality Models. 1664-1673
Seonghyeon Nam, Youngbae Hwang, Yasuyuki Matsushita, Seon Joo Kim:
A Holistic Approach to Cross-Channel Image Noise Modeling and Its Application to Image Denoising. 1683-1691
Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu, Shuhang Gu, Wangmeng Zuo, Lei Zhang:
Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization. 1692-1700
Wei-Sheng Lai, Jia-Bin Huang, Zhe Hu, Narendra Ahuja, Ming-Hsuan Yang:
A Comparative Study for Single Image Blind Deblurring. 1701-1709
Poster Session P2-1
Minh Vo, Srinivasa G. Narasimhan, Yaser Sheikh:
Spatiotemporal Bundle Adjustment for Dynamic 3D Reconstruction. 1710-1718
Ajad Chhatkuli, Daniel Pizarro, Toby Collins, Adrien Bartoli:
Inextensible Non-Rigid Shape-from-Motion by Second-Order Cone Programming. 1719-1727
Johan Fredriksson, Viktor Larsson, Carl Olsson, Fredrik Kahl:
Optimal Relative Pose with Unknown Correspondences. 1728-1736
Haifei Huang, Hui Zhang, Yiu-Ming Cheung:
Homography Estimation from the Common Self-Polar Triangle of Separate Ellipses. 1737-1744
Anders P. Eriksson, John Bastian, Tat-Jun Chin, Mats Isaksson:
A Consensus-Based Framework for Distributed Bundle Adjustment. 1754-1762
Kyungdon Joo, Tae-Hyun Oh, Jun-Sik Kim, In-So Kweon:
Globally Optimal Manhattan Frame Estimation in Real-Time. 1763-1771
Kai Han, Kwan-Yee K. Wong, Dirk Schnieders, Miaomiao Liu:
Mirror Surface Reconstruction under an Uncalibrated Camera. 1772-1780
Guibo Luo, Yuesheng Zhu, Zhaotian Li, Liming Zhang:
A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video. 1781-1789
Yinqiang Zheng, Laurent Kneip:
A Direct Least-Squares Solution to the PnP Problem with Unknown Focal Length. 1790-1798
Zuzana Kukelova, Jan Heller, Andrew W. Fitzgibbon:
Efficient Intersection of Three Quadrics and Applications in Computer Vision. 1799-1808
Lior Talker, Yael Moses, Ilan Shimshoni:
Using Spatial Order to Boost the Elimination of Incorrect Feature Matches. 1809-1817
Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan, Michael Felsberg:
A Probabilistic Framework for Color-Based Point Set Registration. 1818-1826
Dong Gong, Mingkui Tan, Yanning Zhang, Anton van den Hengel, Qinfeng Shi:
Blind Image Deconvolution by Automatic Gradient Activation. 1827-1836
Eduardo Perez-Pellitero, Jordi Salvador, Javier Ruiz Hidalgo, Bodo Rosenhahn:
PSyCo: Manifold Span Reduction for Super Resolution. 1837-1845
Zhe Hu, Lu Yuan, Stephen Lin, Ming-Hsuan Yang:
Image Deblurring Using Smartphone Inertial Sensors. 1855-1864
Radu Timofte, Rasmus Rothe, Luc Van Gool:
Seven Ways to Improve Example-Based Single Image Super Resolution. 1865-1873
Wenzhe Shi, Jose Caballero, Ferenc Huszar, Johannes Totz, Andrew P. Aitken, Rob Bishop, Daniel Rueckert, Zehan Wang:
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. 1874-1883
Xiaojun Chang, Yaoliang Yu, Yi Yang, Eric P. Xing:
They are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers. 1884-1893
Minghuang Ma, Haoqi Fan, Kris M. Kitani:
Going Deeper into First-Person Activity Recognition. 1894-1903
Yang Zhou, Bingbing Ni, Richang Hong, Xiaokang Yang, Qi Tian:
Cascaded Interactional Targeting Network for Egocentric Video Analysis. 1904-1913
Fabian Caba Heilbron, Juan Carlos Niebles, Bernard Ghanem:
Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos. 1914-1923
Basura Fernando, Peter Anderson, Marcus Hutter, Stephen Gould:
Discriminative Hierarchical Rank Pooling for Activity Recognition. 1924-1932
Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman:
Convolutional Two-Stream Network Fusion for Video Action Recognition. 1933-1941
Shugao Ma, Leonid Sigal, Stan Sclaroff:
Learning Activity Progression in LSTMs for Activity Detection and Early Detection. 1942-1950
Yingwei Li, Weixin Li, Vijay Mahadevan, Nuno Vasconcelos:
VLAD3: Encoding Dynamics of Deep Features for Action Recognition. 1951-1960
Bharat Singh, Tim K. Marks, Michael J. Jones, Oncel Tuzel, Ming Shao:
A Multi-stream Bi-directional Recurrent Neural Network for Fine-Grained Action Detection. 1961-1970
Mostafa S. Ibrahim, Srikanth Muralidharan, Zhiwei Deng, Arash Vahdat, Greg Mori:
A Hierarchical Deep Temporal Model for Group Activity Recognition. 1971-1980
Ivan Lillo, Juan Carlos Niebles, Alvaro Soto:
A Hierarchical Pose-Based Approach to Complex Action Understanding Using Dictionaries of Actionlets and Motion Poselets. 1981-1990
Wangjiang Zhu, Jie Hu, Gang Sun, Xudong Cao, Yu Qiao:
A Key Volume Mining Deep Framework for Action Recognition. 1991-1999
Eng-Jon Ong, Miroslaw Bober:
Improved Hamming Distance Search Using Variable Length Hashing. 2000-2008
Jae-Pil Heo, Zhe Lin, Xiaohui Shen, Jonathan Brandt, Sung-Eui Yoon:
Shortlist Selection with Residual-Aware Distance Estimator for K-Nearest Neighbor Search. 2009-2017
Xiaojuan Wang, Ting Zhang, Guo-Jun Qi, Jinhui Tang, Jingdong Wang:
Supervised Quantization for Similarity Search. 2018-2026
Patrick Wieschollek, Oliver Wang, Alexander Sorkine-Hornung, Hendrik P. A. Lensch:
Efficient Large-Scale Approximate Nearest Neighbor Search on the GPU. 2027-2035
Thi Quynh Nhi Tran, Hervé Le Borgne, Michel Crucianu:
Aggregating Image and Text Quantized Correlated Components. 2046-2054
Artem Babenko, Victor S. Lempitsky:
Efficient Indexing of Billion-Scale Datasets of Deep Descriptors. 2055-2063
Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen:
Deep Supervised Hashing for Fast Image Retrieval. 2064-2072
Ahmet Iscen, Michael G. Rabbat, Teddy Furon:
Efficient Large-Scale Similarity Search Using Matrix Factorization. 2073-2081
Theodora Kontogianni, Markus Mathias, Bastian Leibe:
Incremental Object Discovery in Time-Varying Image Collections. 2082-2090
Jia-Bin Huang, Rich Caruana, Andrew Farnsworth, Steve Kelling, Narendra Ahuja:
Detecting Migrating Birds at Night. 2091-2099
Ilja Kuzborskij, Fabio Maria Carlucci, Barbara Caputo:
When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks. 2100-2109
Zhe Zhu, Dun Liang, Song-Hai Zhang, Xiaolei Huang, Baoli Li, Shi-Min Hu:
Traffic-Sign Detection and Classification in the Wild. 2110-2118
Yuxing Tang, Josiah Wang, Boyang Gao, Emmanuel Dellandréa, Robert J. Gaizauskas, Liming Chen:
Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer. 2119-2128
Fan Yang, Wongun Choi, Yuanqing Lin:
Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers. 2129-2137
Keze Wang, Liang Lin, Wangmeng Zuo, Shuhang Gu, Lei Zhang:
Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection. 2138-2146
Xiaozhi Chen, Kaustav Kundu, Ziyu Zhang, Huimin Ma, Sanja Fidler, Raquel Urtasun:
Monocular 3D Object Detection for Autonomous Driving. 2147-2156
Radu Tudor Ionescu, Bogdan Alexe, Marius Leordeanu, Marius Popescu, Dim P. Papadopoulos, Vittorio Ferrari:
How Hard Can It Be? Estimating the Difficulty of Visual Search in an Image. 2157-2166
Hongye Liu, Yonghong Tian, Yaowei Wang, Lu Pang, Tiejun Huang:
Deep Relative Distance Learning: Tell the Difference between Similar Vehicles. 2167-2175
Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra M. Bhandarkar, Wojciech Matusik, Antonio Torralba:
Eye Tracking for Everyone. 2176-2184
Zorah Lähner, Emanuele Rodolà, Frank R. Schmidt, Michael M. Bronstein, Daniel Cremers:
Efficient Globally Optimal 2D-to-3D Deformable Shape Matching. 2185-2193
Viktoriia Sharmanska, Daniel Hernández-Lobato, José Miguel Hernández-Lobato, Novi Quadrianto:
Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations. 2194-2202
Roozbeh Mottaghi, Hannaneh Hajishirzi, Ali Farhadi:
A Task-Oriented Approach for Cost-Sensitive Recognition. 2203-2211
Sukrit Shankar, Duncan P. Robertson, Yani Ioannou, Antonio Criminisi, Roberto Cipolla:
Refining Architectures of Deep Convolutional Neural Networks. 2212-2220
Ali Borji, Saeed Izadi, Laurent Itti:
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning. 2221-2230
Chen-Yu Lee, Simon Osindero:
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild. 2231-2239
Venkatesh N. Murthy, Vivek Singh, Terrence Chen, R. Manmatha, Dorin Comaniciu:
Deep Decision Network for Multi-class Image Classification. 2240-2248
Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton van den Hengel:
Less is More: Zero-Shot Learning from Online Textual Documents with Noise Suppression. 2249-2257
Wen Li, Dengxin Dai, Mingkui Tan, Dong Xu, Luc Van Gool:
Fast Algorithms for Linear and Kernel SVM+. 2258-2266
Oral & Spotlight Session 2-2A
O2-2A: Recognition and Labeling

Liang Lin, Guangrun Wang, Rui Zhang, Ruimao Zhang, Xiaodan Liang, Wangmeng Zuo:
Deep Structured Scene Parsing by Learning with Image Descriptions. 2276-2284
Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu:
CNN-RNN: A Unified Framework for Multi-label Image Classification. 2285-2294
Jing Wang, Yu Cheng, Rogério Schmidt Feris:
Walk and Learn: Facial Attribute Representation Learning from Egocentric Video and Contextual Data. 2295-2304
2A: Object Detection 2
Ankush Gupta, Andrea Vedaldi, Andrew Zisserman:
Synthetic Data for Text Localisation in Natural Images. 2315-2324
Russell Stewart, Mykhaylo Andriluka, Andrew Y. Ng:
End-to-End People Detection in Crowded Scenes. 2325-2333
Wei-Chih Tu, Shengfeng He, Qingxiong Yang, Shao-Yi Chien:
Real-Time Salient Object Detection with a Minimum Spanning Tree. 2334-2342
David Feng, Nick Barnes, Shaodi You, Chris McCarthy:
Local Background Enclosure for RGB-D Salient Object Detection. 2343-2350
Yongxi Lu, Tara Javidi, Svetlana Lazebnik:
Adaptive Object Detection Using Adjacency and Zoom Prediction. 2351-2359
Mahyar Najibi, Mohammad Rastegari, Larry S. Davis:
G-CNN: An Iterative Grid Based Object Detector. 2369-2377
Oral & Spotlight Session 2-2B
O2-2B: Computational Photography and Faces
Wei Wang, Zhen Cui, Yan Yan, Jiashi Feng, Shuicheng Yan, Xiangbo Shu, Nicu Sebe:
Recurrent Face Aging. 2378-2386
Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, Matthias Nießner:
Face2Face: Real-Time Face Capture and Reenactment of RGB Videos. 2387-2395
Sergey Tulyakov, Xavier Alameda-Pineda, Elisa Ricci, Lijun Yin, Jeffrey F. Cohn, Nicu Sebe:
Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions. 2396-2404
Andrew Owens, Phillip Isola, Josh H. McDermott, Antonio Torralba, Edward H. Adelson, William T. Freeman:
Visually Indicated Sounds. 2405-2413
Leon A. Gatys, Alexander S. Ecker, Matthias Bethge:
Image Style Transfer Using Convolutional Neural Networks. 2414-2423
S2-2B: Computational Photography and Biomedical Applications
Le Hou, Dimitris Samaras, Tahsin M. Kurç, Yi Gao, James E. Davis, Joel H. Saltz:
Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification. 2424-2433
Hossam N. Isack, Olga Veksler, Milan Sonka, Yuri Boykov:
Hedgehog Shape Priors for Multi-Object Segmentation. 2434-2442
Won Hwa Kim, Hyunwoo J. Kim, Nagesh Adluru, Vikas Singh:
Latent Variable Graphical Model Selection Using Harmonic Analysis: Applications to the Human Connectome Project (HCP). 2443-2451
Gyeongmin Choe, Srinivasa G. Narasimhan, In-So Kweon:
Simultaneous Estimation of Near IR BRDF and Fine-Scale Surface Geometry. 2452-2460
Seoung Wug Oh, Michael S. Brown, Marc Pollefeys, Seon Joo Kim:
Do It Yourself Hyperspectral Imaging with Everyday Digital Cameras. 2461-2469
Joon-Young Lee, Kalyan Sunkavalli, Zhe Lin, Xiaohui Shen, In-So Kweon:
Automatic Content-Aware Color and Tone Stylization. 2470-2478
Chuan Li, Michael Wand:
Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis. 2479-2486
Poster Session P2-2
Hao Chen, Xiaojuan Qi, Lequan Yu, Pheng-Ann Heng:
DCAN: Deep Contour-Aware Networks for Accurate Gland Segmentation. 2487-2496
Hoo-Chang Shin, Kirk Roberts, Le Lu, Dina Demner-Fushman, Jianhua Yao, Ronald M. Summers:
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation. 2497-2506
Huu Le, Tat-Jun Chin, David Suter:
Conformal Surface Alignment with Optimal Möbius Search. 2507-2516
Seong Jae Hwang, Nagesh Adluru, Maxwell D. Collins, Sathya N. Ravi, Barbara B. Bendlin, Sterling C. Johnson, Vikas Singh:
Coupled Harmonic Bases for Longitudinal Characterization of Brain Networks. 2517-2525
Jae Y. Shin, Nima Tajbakhsh, R. Todd Hurst, Christopher B. Kendall, Jianming Liang:
Automating Carotid Intima-Media Thickness Video Interpretation with Convolutional Neural Networks. 2526-2535
Deepak Pathak, Philipp Krähenbühl, Jeff Donahue, Trevor Darrell, Alexei A. Efros:
Context Encoders: Feature Learning by Inpainting. 2536-2544
Chenyi Lei, Dong Liu, Weiping Li, Zheng-Jun Zha, Houqiang Li:
Comparative Deep Learning of Hybrid Representations for Image Recommendations. 2545-2553
Zeeshan Hayder, Xuming He, Mathieu Salzmann:
Learning to Co-Generate Object Proposals with a Deep Structured Network. 2565-2573
Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Pascal Frossard:
DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks. 2574-2582
Calvin Murdock, Zhen Li, Howard Zhou, Tom Duerig:
Blockout: Dynamic Model Selection for Hierarchical Deep Networks. 2583-2591
Forrest N. Iandola, Matthew W. Moskewicz, Khalid Ashraf, Kurt Keutzer:
FireCaffe: Near-Linear Acceleration of Deep Neural Network Training on Compute Clusters. 2592-2600
Sarah Rastegar, Mahdieh Soleymani Baghshah, Hamid R. Rabiee, Seyed Mohsen Shojaee:
MDL-CW: A Multimodal Deep Learning Framework with CrossWeights. 2601-2609
Jörn-Henrik Jacobsen, Jan C. van Gemert, Zhongyu Lou, Arnold W. M. Smeulders:
Structured Receptive Fields in CNNs. 2610-2619
Suriya Singh, Chetan Arora, C. V. Jawahar:
First Person Action Recognition Using Deep Learned Descriptors. 2620-2628
Ryo Yonetani, Kris M. Kitani, Yoichi Sato:
Recognizing Micro-Actions and Reactions from Paired Egocentric Videos. 2629-2638
Chun-yu Wang, Yizhou Wang, Alan L. Yuille:
Mining 3D Key-Pose-Motifs for Action Recognition. 2639-2647
Khurram Soomro, Haroon Idrees, Mubarak Shah:
Predicting the Where and What of Actors and Actions through Online Action Localization. 2648-2657
Young Joon Yoo, Kimin Yun, Sangdoo Yun, Jonghee Hong, Hawook Jeong, Jin Young Choi:
Visual Path Prediction in Complex Scenes with Crowded Moving Objects. 2668-2677
Serena Yeung, Olga Russakovsky, Greg Mori, Li Fei-Fei:
End-to-End Learning of Action Detection from Frame Glimpses in Videos. 2678-2687
Analí Alfaro, Domingo Mery, Alvaro Soto:
Action Recognition in Video Using Sparse Coding and Relative Features. 2688-2697
Limin Wang, Yu Qiao, Xiaoou Tang, Luc J. Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. 2708-2717
Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-Time Action Recognition with Enhanced Motion Vector CNNs. 2718-2726
Yu Li, Robby T. Tan, Xiaojie Guo, Jiangbo Lu, Michael S. Brown:
Rain Streak Removal Using Layer Priors. 2736-2744
Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi:
Gradient-Domain Image Reconstruction Framework with Intensity-Range and Base-Structure Constraints. 2745-2753
Jialei Wang, Peder A. Olsen, Andrew R. Conn, Aurelie C. Lozano:
Removing Clouds and Recovering Ground Observations in Satellite Image Sequences via Temporally Contiguous Robust Matrix Completion. 2754-2763
Zhangyang Wang, Ding Liu, Shiyu Chang, Qing Ling, Yingzhen Yang, Thomas S. Huang:
D3: Deep Dual-Domain Based Fast Restoration of JPEG-Compressed Images. 2764-2772
Vijay Rengarajan, A. N. Rajagopalan, Rangarajan Aravind:
From Bows to Arrows: Rolling Shutter Rectification of Urban Scenes. 2773-2781
Xueyang Fu, Delu Zeng, Yue Huang, Xiao-Ping Zhang, Xinghao Ding:
A Weighted Variational Model for Simultaneous Reflectance and Illumination Estimation. 2782-2790
Jin-shan Pan, Zhouchen Lin, Zhixun Su, Ming-Hsuan Yang:
Robust Kernel Estimation with Outliers Handling for Image Deblurring. 2800-2808
Hanwang Zhang, Xindi Shang, Wenzhuo Yang, Huan Xu, Huan-Bo Luan, Tat-Seng Chua:
Online Collaborative Learning for Open-Vocabulary Visual Classifiers. 2809-2817
Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna:
Rethinking the Inception Architecture for Computer Vision. 2818-2826
Saurabh Gupta, Judy Hoffman, Jitendra Malik:
Cross Modal Distillation for Supervision Transfer. 2827-2836
Trung T. Pham, Seyed Hamid Rezatofighi, Ian D. Reid, Tat-Jun Chin:
Efficient Point Process Inference for Large-Scale Object Detection. 2837-2845
Jacob Chan, Jimmy Addison Lee, Kemao Qian:
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition. 2855-2863
Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross B. Girshick:
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks. 2874-2883
Gong Cheng, Peicheng Zhou, Junwei Han:
RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection. 2884-2893
Stefan Mathe, Aleksis Pirinen, Cristian Sminchisescu:
Reinforcement Learning for Visual Object Detection. 2894-2902
Inbar Huberman, Raanan Fattal:
Detecting Repeating Objects Using Patch Correlation Analysis. 2903-2911
Sebastian Lapuschkin, Alexander Binder, Grégoire Montavon, Klaus-Robert Müller, Wojciech Samek:
Analyzing Classifiers: Fisher Vectors and Deep Neural Networks. 2912-2920
Bolei Zhou, Aditya Khosla, Àgata Lapedriza, Aude Oliva, Antonio Torralba:
Learning Deep Features for Discriminative Localization. 2921-2929
Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross B. Girshick:
Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels. 2930-2939
Lluis Castrejon, Yusuf Aytar, Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:
Learning Aligned Cross-Modal Representations from Weakly Aligned Data. 2940-2949
Sijia Cai, Lei Zhang, Wangmeng Zuo, Xiangchu Feng:
A Probabilistic Collaborative Representation Based Approach for Pattern Classification. 2950-2959
Hexiang Hu, Guang-Tong Zhou, Zhiwei Deng, Zicheng Liao, Greg Mori:
Learning Structured Inference Neural Networks with Label Relations. 2960-2968
Hongyuan Zhu, Jean-Baptiste Weibel, Shijian Lu:
Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition. 2969-2976
Qiang Li, Maoying Qiao, Wei Bian, Dacheng Tao:
Conditional Graphical Lasso for Multi-label Image Classification. 2977-2986
Carl Vondrick, Deniz Oktay, Hamed Pirsiavash, Antonio Torralba:
Predicting Motivations of Actions by Leveraging Text. 2997-3005
Jakub Sochor, Adam Herout, Jirí Havel:
BoxCars: 3D Boxes as CNN Input for Improved Fine-Grained Vehicle Recognition. 3006-3015
Xu Liu, Zilei Wang, Jiashi Feng, Hongsheng Xi:
Highway Vehicle Counting in Compressed Domain. 3016-3024
Shiyao Huang, Xianghua Ying, Jiangpeng Rong, Zeyu Shang, Hongbin Zha:
Camera Calibration from Periodic Motion of a Pedestrian. 3025-3033
Oral & Spotlight Session 3-1A
O3-1A: Actions and Human Pose
Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould:
Dynamic Image Networks for Action Recognition. 3034-3042
Vignesh Ramanathan, Jonathan Huang, Sami Abu-El-Haija, Alexander N. Gorban, Kevin Murphy, Li Fei-Fei:
Detecting Events and Key Actors in Multi-person Videos. 3043-3053
Behrooz Mahasseni, Sinisa Todorovic:
Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition. 3054-3062
James Charles, Tomas Pfister, Derek R. Magee, David C. Hogg, Andrew Zisserman:
Personalizing Human Video Pose Estimation. 3063-3072
Wei Yang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation. 3073-3082
S3-1A: Activity Recognition
Chenliang Xu, Jason J. Corso:
Actor-Action Semantic Segmentation with Grouping Process Models. 3083-3092
Jun Yuan, Bingbing Ni, Xiaokang Yang, Ashraf A. Kassim:
Temporal Action Localization with Pyramid of Score Distribution Features. 3093-3102
Katsunori Ohnishi, Atsushi Kanehira, Asako Kanezaki, Tatsuya Harada:
Recognizing Activities of Daily Living with a Wrist-Mounted Camera. 3103-3111
Zuxuan Wu, Yanwei Fu, Yu-Gang Jiang, Leonid Sigal:
Harnessing Object and Scene Semantics for Large-Scale Video Understanding. 3112-3121
Alexander Richard, Juergen Gall:
Temporal Action Detection Using a Statistical Language Model. 3131-3140
Oral & Spotlight Session 3-1B
O3-1B: Semantic Segmentation
Shu Liu, Xiaojuan Qi, Jianping Shi, Hong Zhang, Jiaya Jia:
Multi-scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation. 3141-3149
Jifeng Dai, Kaiming He, Jian Sun:
Instance-Aware Semantic Segmentation via Multi-task Network Cascades. 3150-3158
Di Lin, Jifeng Dai, Jiaya Jia, Kaiming He, Jian Sun:
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation. 3159-3167
Abhijit Kundu, Vibhav Vineet, Vladlen Koltun:
Feature Space Optimization for Semantic Video Segmentation. 3168-3175
Maros Blaha, Christoph Vogel, Audrey Richard, Jan Dirk Wegner, Thomas Pock, Konrad Schindler:
Large-Scale Semantic 3D Reconstruction: An Adaptive Multi-resolution Model for Multi-class Volumetric Labeling. 3176-3184
S3-1B: Semantic Parsing and Segmentation
Xiaodan Liang, Xiaohui Shen, Donglai Xiang, Jiashi Feng, Liang Lin, Shuicheng Yan:
Semantic Object Parsing with Local-Global Long Short-Term Memory. 3185-3193
Guosheng Lin, Chunhua Shen, Anton van den Hengel, Ian D. Reid:
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation. 3194-3203
Seunghoon Hong, Junhyuk Oh, Honglak Lee, Bohyung Han:
Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network. 3204-3212
Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele:
The Cityscapes Dataset for Semantic Urban Scene Understanding. 3213-3223
Raviteja Vemulapalli, Oncel Tuzel, Ming-Yu Liu, Rama Chellappa:
Gaussian Conditional Random Field Network for Semantic Segmentation. 3224-3233
Germán Ros, Laura Sellart, Joanna Materzynska, David Vázquez, Antonio M. López:
The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes. 3234-3243
Poster Session P3-1

Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker:
WarpNet: Weakly Supervised Matching for Single-View Reconstruction. 3253-3261
Ole Johannsen, Antonin Sulc, Bastian Goldluecke:
What Sparse Light Field Coding Reveals about Scene Structure. 3262-3270
Hao Wang, Jun Wang, Liang Wang:
Online Reconstruction of Indoor Scenes from RGB-D Streams. 3271-3279
Ali Osman Ulusoy, Michael J. Black, Andreas Geiger:
Patches, Planes and Probabilities: A Non-Local Prior for Volumetric 3D Reconstruction. 3280-3289
Ian Schillebeeckx, Robert Pless:
Single Image Camera Calibration with Lenticular Arrays for Augmented Reality. 3290-3298
Diego Thomas, Rin-ichiro Taniguchi:
Augmented Blendshapes for Real-Time Simultaneous 3D Head Modeling and Facial Motion Capture. 3299-3308

Cédric Verleysen, Christophe De Vleeschouwer:
Piecewise-Planar 3D Approximation from Wide-Baseline Stereo. 3327-3336
Olivier Saurer, Marc Pollefeys, Gim Hee Lee:
Sparse to Dense 3D Reconstruction from Rolling Shutter Images. 3337-3345
Cenek Albl, Zuzana Kukelova, Tomás Pajdla:
Rolling Shutter Absolute Pose Problem with Known Vertical Direction. 3355-3363
Eric Brachmann, Frank Michel, Alexander Krull, Michael Ying Yang, Stefan Gumhold, Carsten Rother:
Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image. 3364-3372
Andrey Bushnevskiy, Lorenzo Sorgi, Bodo Rosenhahn:
Multicamera Calibration from Visible and Mirrored Epipoles. 3373-3381
Lazaros Zafeiriou, Epameinondas Antonakos, Stefanos Zafeiriou, Maja Pantic:
Joint Unsupervised Deformable Spatio-Temporal Alignment of Sequences. 3382-3390
Kaili Zhao, Wen-Sheng Chu, Honggang Zhang:
Deep Region and Multi-label Learning for Facial Action Unit Detection. 3391-3399
Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang:
Unconstrained Face Alignment via Cascaded Compositional Learning. 3409-3417
Marcel Piotraschke, Volker Blanz:
Automated 3D Face Reconstruction from Multiple Images Using Quality Measures. 3418-3427
Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen:
Occlusion-Free Face Alignment: Deep Regression Networks Coupled with De-Corrupt AutoEncoders. 3428-3437
Zheng Zhang, Jeffrey M. Girard, Yue Wu, Xing Zhang, Peng Liu, Umur A. Ciftci, Shaun J. Canavan, Michael Reale, Andrew Horowitz, Huiyuan Yang, Jeffrey F. Cohn, Qiang Ji, Lijun Yin:
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis. 3438-3446
Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu:
Joint Training of Cascaded CNN for Face Detection. 3456-3465
Rui Zhao, Quan Gan, Shangfei Wang, Qiang Ji:
Facial Expression Intensity Estimation Using Ordinal Information. 3466-3474
Chen Sun, Manohar Paluri, Ronan Collobert, Ram Nevatia, Lubomir D. Bourdev:
ProNet: Learning to Propose Object-Specific Boxes for Cascaded Neural Networks. 3485-3493
Christopher Thomas, Adriana Kovashka:
Seeing Behind the Camera: Identifying the Authorship of a Photograph. 3494-3502
Shuochen Su, Felix Heide, Robin Swanson, Jonathan Klein, Clara Callenberg, Matthias B. Hullin, Wolfgang Heidrich:
Material Classification Using Raw Time-of-Flight Measurements. 3503-3511
Dong Li, Jia-Bin Huang, Yali Li, Shengjin Wang, Ming-Hsuan Yang:
Weakly Supervised Object Localization with Progressive Domain Adaptation. 3512-3520
Roozbeh Mottaghi, Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi:
Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images. 3521-3529
Ali Harakeh, Daniel C. Asmar, Elie A. Shammas:
Identifying Good Training Data for Self-Supervised Free Space Estimation. 3530-3538
Hani Altwaijry, Eduard Trulls, James Hays, Pascal Fua, Serge J. Belongie:
Learning to Match Aerial Images with Deep Attentive Architectures. 3539-3547
Krishna Kumar Singh, Fanyi Xiao, Yong Jae Lee:
Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection. 3548-3556
Ali Diba, Ali Mohammad Pazandeh, Hamed Pirsiavash, Luc Van Gool:
DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns. 3557-3565
Hojin Cho, Myung-Chul Sung, Bongjin Jun:
Canny Text Detector: Fast and Robust Scene Text Localization Algorithm. 3566-3573
Di Hu, Xuelong Li, Xiaoqiang Lu:
Temporal Multimodal Learning in Audiovisual Speech Recognition. 3574-3582
Andreas Doumanoglou, Rigas Kouskouridas, Sotiris Malassiotis, Tae-Kyun Kim:
Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd. 3583-3592
Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann:
Robust 3D Hand Pose Estimation in Single Depth Images: From Single-View CNN to Multi-View CNNs. 3593-3601
Gedas Bertasius, Jianbo Shi, Lorenzo Torresani:
Semantic Segmentation with Boundary Neural Fields. 3602-3610
Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun:
HD Maps: Fine-Grained Road Segmentation by Parsing Ground and Aerial Images. 3611-3619
Bing Shuai, Zhen Zuo, Bing Wang, Gang Wang:
DAG-Recurrent Neural Networks for Scene Labeling. 3620-3629
Baisheng Lai, Xiaojin Gong:
Saliency Guided Dictionary Learning for Weakly-Supervised Image Parsing. 3630-3639
Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, Alan L. Yuille:
Attention to Scale: Scale-Aware Semantic Image Segmentation. 3640-3649

Jason Kuen, Zhenhua Wang, Gang Wang:
Recurrent Attentional Networks for Saliency Detection. 3668-3677
Guillaume Seguin, Piotr Bojanowski, Rémi Lajugie, Ivan Laptev:
Instance-Level Video Segmentation from Object Tracks. 3678-3687
Jun Xie, Martin Kiefel, Ming-Ting Sun, Andreas Geiger:
Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer. 3688-3697
Amir Kolaman, Maxim Lvov, Rami R. Hagege, Hugo Guterman:
Amplitude Modulated Video Camera - Light Separation in Dynamic Scenes. 3698-3706
Boxin Shi, Zhe Wu, Zhipeng Mo, Dinglong Duan, Sai-Kit Yeung, Ping Tan:
A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo. 3707-3716
Ting-Chun Wang, Manohar Srikanth, Ravi Ramamoorthi:
Depth from Semi-Calibrated Stereo and Defocus. 3717-3726
Ying Fu, Yinqiang Zheng, Imari Sato, Yoichi Sato:
Exploiting Spectral-Spatial Correlation for Coded Hyperspectral Image Restoration. 3727-3736
Julie Chang, Isaac Kauvar, Xuemei Hu, Gordon Wetzstein:
Variable Aperture Light Field Photography: Overcoming the Diffraction-Limited Spatio-Angular Resolution Tradeoff. 3737-3745
Rajat Aggarwal, Amrisha Vohra, Anoop M. Namboodiri:
Panoramic Stereo Videos with a Single Camera. 3755-3763
Yoshie Kobayashi, Tetsuro Morimoto, Imari Sato, Yasuhiro Mukaigawa, Takao Tomono, Katsushi Ikeuchi:
Reconstructing Shapes and Appearances of Thin Film Objects Using RGB Images. 3774-3782
Tomas F. Yago Vicente, Minh Hoai, Dimitris Samaras:
Noisy Label Recovery for Shadow Detection in Unfamiliar Domains. 3783-3792
Oral & Spotlight Session 3-2A
O3-2A: Video Understanding
Oscar Koller, Hermann Ney, Richard Bowden:
Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled. 3793-3802
Edward Johns, Stefan Leutenegger, Andrew J. Davison:
Pairwise Decomposition of Image Sequences for Active Multi-view Recognition. 3813-3822
Yixin Zhu, Chenfanfu Jiang, Yibiao Zhao, Demetri Terzopoulos, Song-Chun Zhu:
Inferring Forces and Learning Human Utilities from Videos. 3823-3833
Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi:
Force from Motion: Decoding Physical Sensation in a First Person Video. 3834-3842
S3-2A: Video Analysis 2
Pan Ji, Hongdong Li, Mathieu Salzmann, Yiran Zhong:
Robust Multi-Body Feature Tracker: A Segmentation-Free Approach. 3843-3851
Dinesh Jayaraman, Kristen Grauman:
Slow and Steady Feature Analysis: Higher Order Temporal Coherence in Video. 3852-3861
Chun-Hao Huang, Benjamin Allain, Jean-Sébastien Franco, Nassir Navab, Slobodan Ilic, Edmond Boyer:
Volumetric 3D Tracking by Detection. 3862-3870
Shoou-I Yu, Deyu Meng, Wangmeng Zuo, Alexander G. Hauptmann:
The Solution Path Algorithm for Identity-Aware Multi-object Tracking. 3871-3879
Tianzhu Zhang, Adel Bibi, Bernard Ghanem:
In Defense of Sparse Tracking: Circulant Sparse Tracker. 3880-3888
Laura Sevilla-Lara, Deqing Sun, Varun Jampani, Michael J. Black:
Optical Flow with Semantic Segmentation and Localized Layers. 3889-3898
Oral & Spotlight Session 3-2B
O3-2B: Grouping and Optimization Methods
Marc Teva Law, Yaoliang Yu, Matthieu Cord, Eric P. Xing:
Closed-Form Training of Mahalanobis Distance for Supervised Clustering. 3909-3917
Chong You, Daniel P. Robinson, René Vidal:
Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit. 3918-3927
Chong You, Chun-Guang Li, Daniel P. Robinson, René Vidal:
Oracle Based Active Set Algorithm for Scalable Elastic Net Subspace Clustering. 3928-3937
Wen-bing Huang, Fuchun Sun, Le-le Cao, Deli Zhao, Huaping Liu, Mehrtash Harandi:
Sparse Coding and Dictionary Learning with Linear Dynamical Systems. 3938-3947
Thomas Möllenhoff, Emanuel Laude, Michael Möller, Jan Lellmann, Daniel Cremers:
Sublabel-Accurate Relaxation of Nonconvex Energies. 3948-3956
S3-2B: Statistical Methods and Transfer Learning

Viktoriia Sharmanska, Novi Quadrianto:
Learning from the Mistakes of Others: Matching Errors in Cross-Dataset Learning. 3967-3975
Rudrasis Chakraborty, Dohyung Seo, Baba C. Vemuri:
An Efficient Exact-PGA Algorithm for Constant Curvature Manifolds. 3976-3984
Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert:
Cross-Stitch Networks for Multi-task Learning. 3994-4003
Hyun Oh Song, Yu Xiang, Stefanie Jegelka, Silvio Savarese:
Deep Metric Learning via Lifted Structured Feature Embedding. 4004-4012
Poster Session P3-2
Ang Li, Dapeng Chen, Yuanliu Liu, Zejian Yuan:
Coordinating Multiple Disparity Proposals for Stereo Computation. 4022-4030
Chi Zhang, Zhiwei Li, Rui Cai, Hongyang Chao, Yong Rui:
Joint Multiview Segmentation and Localization of RGB-D Images Using Depth-Induced Silhouette Consistency. 4031-4039
Nikolaus Mayer, Eddy Ilg, Philip Häusser, Philipp Fischer, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox:
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation. 4040-4048
Wei Feng, Fei-Peng Tian, Qian Zhang, Jizhou Sun:
6D Dynamic Camera Relocalization from Single Reference Image. 4049-4057
Rene Ranftl, Vibhav Vineet, Qifeng Chen, Vladlen Koltun:
Dense Monocular Depth Estimation in Complex Dynamic Scenes. 4058-4066
Christian Mostegel, Markus Rumpler, Friedrich Fraundorfer, Horst Bischof:
Using Self-Contradiction to Learn Confidence Measures in Stereo Vision. 4067-4076
Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla:
Understanding RealWorld Indoor Scenes with Synthetic Data. 4077-4085
Hae-Gon Jeon, Joon-Young Lee, Sunghoon Im, Hyowon Ha, In-So Kweon:
Stereo Matching with Color and Monochrome Cameras in Low-Light Conditions. 4086-4094
Gil Ben-Artzi, Yoni Kasten, Shmuel Peleg, Michael Werman:
Camera Calibration from Dynamic Silhouettes Using Motion Barcodes. 4095-4103
Wencheng Wang, Tianhao Gao:
Constructing Canonical Regions for Fast and Effective View Selection. 4114-4122
Yuchao Dai, Hongdong Li, Laurent Kneip:
Rolling Shutter Camera Relative Pose: Generalized Epipolar Geometry. 4132-4140
Ayan Sinha, Chiho Choi, Karthik Ramani:
DeepHand: Robust Hand Pose Estimation by Completing a Matrix Imputed with Deep Features. 4150-4158
Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu, Xiang Bai:
Multi-oriented Text Detection with Fully Convolutional Networks. 4159-4167
Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai:
Robust Scene Text Recognition with Automatic Rectification. 4168-4176
George Trigeorgis, Patrick Snape, Mihalis A. Nicolaou, Epameinondas Antonakos, Stefanos Zafeiriou:
Mnemonic Descent Method: A Recurrent Process Applied for End-to-End Face Alignment. 4177-4187
Amin Jourabloo, Xiaoming Liu:
Large-Pose Face Alignment via CNN-Based Dense 3D Model Fitting. 4188-4196
Joseph Roth, Yiying Tong, Xiaoming Liu:
Adaptive 3D Face Reconstruction from Unconstrained Photo Collections. 4197-4206
Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz:
Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks. 4207-4215
Hyung Jin Chang, Tobias Fischer, Maxime Petit, Martina Zambelli, Yiannis Demiris:
Kinematic Structure Correspondences via Hypergraph Matching. 4216-4225
Binod Bhattarai, Gaurav Sharma, Frédéric Jurie:
CP-mtML: Coupled Projection Multi-Task Metric Learning for Large Scale Face Retrieval. 4226-4235
Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato:
Joint Recovery of Dense Correspondence and Cosegmentation in Two Images. 4246-4255
Yuanlu Xu, Xiaobai Liu, Yang Liu, Song-Chun Zhu:
Multi-view People Tracking via Hierarchical Trajectory Composition. 4256-4265
Jifeng Ning, Jimei Yang, Shaojie Jiang, Lei Zhang, Ming-Hsuan Yang:
Object Tracking via Dual Linear Structured SVM and Explicit Feature Map. 4266-4274

Hyeonseob Nam, Bohyung Han:
Learning Multi-domain Convolutional Neural Networks for Visual Tracking. 4293-4302
Yuankai Qi, Shengping Zhang, Lei Qin, Hongxun Yao, Qingming Huang, Jongwoo Lim, Ming-Hsuan Yang:
Hedged Deep Tracking. 4303-4311
Si Liu, Tianzhu Zhang, Xiaochun Cao, Changsheng Xu:
Structural Correlation Filter for Robust Visual Tracking. 4312-4320
Jongwon Choi, Hyung Jin Chang, Jiyeoup Jeong, Yiannis Demiris, Jin Young Choi:
Visual Tracking Using Attention-Modulated Disintegration and Integration. 4321-4330
Vikas Dhiman, Quoc-Huy Tran, Jason J. Corso, Manmohan Chandraker:
A Continuous Occlusion Model for Road Scene Understanding. 4331-4339
Adrien Gaidon, Qiao Wang, Yohann Cabon, Eleonora Vig:
VirtualWorlds as Proxy for Multi-object Tracking Analysis. 4340-4349
Keisuke Midorikawa, Toshihiko Yamasaki, Kiyoharu Aizawa:
Uncalibrated Photometric Stereo by Stepwise Optimization Using Principal Components of Isotropic BRDFs. 4350-4358
Yvain Quéau, Roberto Mecca, Jean-Denis Durou:
Unbiased Photometric Stereo for Colored Surfaces: A Variational Approach. 4359-4368
Yiming Qian, Minglun Gong, Yee-Hong Yang:
3D Reconstruction of Transparent Objects with Position-Normal Consistency. 4369-4377
Roy Or-El, Rom Hershkovitz, Aaron Wetzler, Guy Rosman, Alfred M. Bruckstein, Ron Kimmel:
Real-Time Depth Refinement for Specular Objects. 4378-4386
Kenichiro Tanaka, Yasuhiro Mukaigawa, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi:
Recovering Transparent Shape from Time-of-Flight Distortion. 4387-4395
Nianyi Li, Haiting Lin, Bilin Sun, Mingyuan Zhou, Jingyi Yu:
Rotational Crossed-Slit Light Fields. 4405-4413
Fabrizio Natola, Valsamis Ntouskos, Fiora Pirri, Marta Sanzari:
Single Image Object Modeling Based on BRDF and r-Surfaces Learning. 4414-4423
Monami Banerjee, Rudrasis Chakraborty, Edward Ofori, Michael S. Okun, David E. Vaillancourt, Baba C. Vemuri:
A Nonlinear Regression Technique for Manifold Valued Data with Applications to Medical Image Analysis. 4424-4432
Qilong Wang, Peihua Li, Wangmeng Zuo, Lei Zhang:
RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian with Application to Material Recognition. 4433-4441
Nikolaos Karianakis, Jingming Dong, Stefano Soatto:
An Empirical Evaluation of Current Convolutional Architectures' Ability to Manage Nuisance Location and Scale Variability. 4442-4451
Varun Jampani, Martin Kiefel, Peter V. Gehler:
Learning Sparse High Dimensional Filters: Image Filtering, Dense CRFs and Bilateral Neural Networks. 4452-4461
Fujiao Ju, Yanfeng Sun, Junbin Gao, Simeng Liu, Yongli Hu, Baocai Yin:
Mixture of Bilateral-Projection Two-Dimensional Probabilistic Principal Component Analysis. 4462-4470
Raviteja Vemulapalli, Rama Chellappa:
Rolling Rotations for Recognizing Human Actions from 3D Skeletal Data. 4471-4479
Stephan Zheng, Yang Song, Thomas Leung, Ian J. Goodfellow:
Improving the Robustness of Deep Neural Networks via Stability Training. 4480-4488
Xikang Zhang, Yin Wang, Mengran Gou, Mario Sznaier, Octavia I. Camps:
Efficient Temporal Sequence Comparison and Classification Using Gram Matrix Embeddings on a Riemannian Manifold. 4498-4507
Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Efstratios Gavves, Tinne Tuytelaars:
Deep Reflectance Maps. 4508-4516
Amir M. Rahimi, Raphael Ruschell, B. S. Manjunath:
UAVSensor Fusion with Latent-Dynamic Conditional Random Fields in Coronal Plane Estimation. 4527-4534
Elena Stumm, Christopher Mei, Simon Lacroix, Juan I. Nieto, Marco Hutter, Roland Siegwart:
Robust Visual Place Recognition with Graph Kernels. 4535-4544
Liang-Chieh Chen, Jonathan T. Barron, George Papandreou, Kevin Murphy, Alan L. Yuille:
Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform. 4545-4554
Oral & Spotlight Session 4-1A
O4-1A: Image & Video Captioning and Descriptions
Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell:
Natural Language Object Retrieval. 4555-4564
Justin Johnson, Andrej Karpathy, Li Fei-Fei:
DenseCap: Fully Convolutional Localization Networks for Dense Captioning. 4565-4574
Jean-Baptiste Alayrac, Piotr Bojanowski, Nishant Agrawal, Josef Sivic, Ivan Laptev, Simon Lacoste-Julien:
Unsupervised Learning from Narrated Instruction Videos. 4575-4583
Haonan Yu, Jiang Wang, Zhiheng Huang, Yi Yang, Wei Xu:
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks. 4584-4593
Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, Yong Rui:
Jointly Modeling Embedding and Translation to Bridge Video and Language. 4594-4602
S4-1A: High Level Semantics
Arjun Chandrasekaran, Ashwin K. Vijayakumar, Stanislaw Antol, Mohit Bansal, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh:
We are Humor Beings: Understanding and Predicting Visual Humor. 4603-4612
Kevin J. Shih, Saurabh Singh, Derek Hoiem:
Where to Look: Focus Regions for Visual Question Answering. 4613-4621
Qi Wu, Peng Wang, Chunhua Shen, Anthony R. Dick, Anton van den Hengel:
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources. 4622-4630
Makarand Tapaswi, Yukun Zhu, Rainer Stiefelhagen, Antonio Torralba, Raquel Urtasun, Sanja Fidler:
MovieQA: Understanding Stories in Movies through Question-Answering. 4631-4640
Yuncheng Li, Yale Song, Liangliang Cao, Joel R. Tetreault, Larry Goldberg, Alejandro Jaimes, Jiebo Luo:
TGIF: A New Dataset and Benchmark on Animated GIF Description. 4641-4650
Quanzeng You, Hailin Jin, Zhaowen Wang, Chen Fang, Jiebo Luo:
Image Captioning with Semantic Attention. 4651-4659
Oral & Spotlight Session 4-1B
O4-1B: Non-rigid Reconstruction and Motion Analysis
Armin Mustafa, Hansung Kim, Jean-Yves Guillemaut, Adrian Hilton:
Temporally Coherent 4D Reconstruction of Complex Dynamic Scenes. 4660-4669
Shaifali Parashar, Daniel Pizarro, Adrien Bartoli:
Isometric Non-rigid Shape-from-Motion in Linear Time. 4679-4687
Jianhui Chen, Hoang Minh Le, Peter Carr, Yisong Yue, James J. Little:
Learning Online Smooth Predictors for Realtime Camera Planning Using Recurrent Decision Trees. 4688-4696
Qifeng Chen, Vladlen Koltun:
Full Flow: Optical Flow Estimation By Global Optimization over Regular Grids. 4706-4714
S4-1B: Human Pose Estimation
Xiao Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Structured Feature Learning for Pose Estimation. 4715-4723
João Carreira, Pulkit Agrawal, Katerina Fragkiadaki, Jitendra Malik:
Human Pose Estimation with Iterative Error Feedback. 4733-4742
Poster Session P4-1
Thibaut Durand, Nicolas Thome, Matthieu Cord:
WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks. 4743-4752
Lingxi Xie, Jingdong Wang, Zhen Wei, Meng Wang, Qi Tian:
DisturbLabel: Regularizing CNN on the Loss Layer. 4753-4762
Leslie N. Smith, Emily M. Hand, Timothy Doster:
Gradual DropIn of Layers to Train Very Deep Neural Networks. 4763-4771
Zhiwei Deng, Arash Vahdat, Hexiang Hu, Greg Mori:
Structure Inference Machines: Recurrent Neural Networks for Analyzing Relations in Group Activity Recognition. 4772-4781
Zhangyang Wang, Shiyu Chang, Yingzhen Yang, Ding Liu, Thomas S. Huang:
Studying Very Low Resolution Recognition Using Deep Networks. 4792-4800
Raviteja Vemulapalli, Oncel Tuzel, Ming-Yu Liu:
Deep Gaussian Conditional Random Field Network: A Model-Based Deep Network for Discriminative Denoising. 4801-4809
Yufei Wang, Zhe Lin, Xiaohui Shen, Radomír Mech, Gavin S. P. Miller, Garrison W. Cottrell:
Event-Specific Image Importance. 4810-4819
Jiaxiang Wu, Cong Leng, Yuhang Wang, Qinghao Hu, Jian Cheng:
Quantized Convolutional Neural Networks for Mobile Devices. 4820-4828
Alexey Dosovitskiy, Thomas Brox:
Inverting Visual Representations with Convolutional Networks. 4829-4837
Iacopo Masi, Stephen Rawls, Gérard G. Medioni, Prem Natarajan:
Pose-Aware Face Recognition in the Wild. 4838-4846
Meina Kan, Shiguang Shan, Xilin Chen:
Multi-view Deep Network for Cross-View Classification. 4847-4855
Yi Sun, Xiaogang Wang, Xiaoou Tang:
Sparsifying Neural Network Connections for Face Recognition. 4856-4864
Qingxiang Feng, Yicong Zhou, Rushi Lan:
Pairwise Linear Regression Classification for Image Set Retrieval. 4865-4872
Ira Kemelmacher-Shlizerman, Steven M. Seitz, Daniel Miller, Evan Brossard:
The MegaFace Benchmark: 1 Million Faces for Recognition at Scale. 4873-4882
Ognjen Arandjelovic:
Learnt Quasi-Transitive Similarity for Retrieval from Large Collections of Faces. 4883-4892
Yandong Wen, Zhifeng Li, Yu Qiao:
Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition. 4893-4901
Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Maja Pantic:
Copula Ordinal Regression for Joint Estimation of Facial Action Unit Intensity. 4902-4910
Timo Bolkart, Stefanie Wuhrer:
A Robust Multilinear Model Learning Framework for 3D Faces. 4911-4919
Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua:
Ordinal Regression with Multiple Output CNN for Age Estimation. 4920-4928
Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter V. Gehler, Bernt Schiele:
DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation. 4929-4937
Suha Kwak, Minsu Cho, Ivan Laptev:
Thin-Slicing for Pose: Learning to Understand Pose without Explicit Pose Estimation. 4938-4947
Hashim Yasin, Umar Iqbal, Björn Krüger, Andreas Weber, Juergen Gall:
A Dual-Source Approach for 3D Pose Estimation from a Single Image. 4948-4956
Markus Oberweger, Gernot Riegler, Paul Wohlhart, Vincent Lepetit:
Efficiently Creating 3D Training Data for Fine Hand Pose Estimation. 4957-4965
Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Konstantinos G. Derpanis, Kostas Daniilidis:
Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video. 4966-4975
Satwik Kottur, Ramakrishna Vedantam, José M. F. Moura, Devi Parikh:
VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes. 4985-4994
Yuke Zhu, Oliver Groth, Michael S. Bernstein, Li Fei-Fei:
Visual7W: Grounded Question Answering in Images. 4995-5004
Liwei Wang, Yin Li, Svetlana Lazebnik:
Learning Deep Structure-Preserving Image-Text Embeddings. 5005-5013
Peng Zhang, Yash Goyal, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:
Yin and Yang: Balancing and Answering Binary Visual Questions. 5014-5022
Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, Longin Jan Latecki:
GIFT: A Real-Time and Scalable 3D Shape Search Engine. 5023-5032
Chao Zhang, William A. P. Smith, Arnaud Dessein, Nick Pears, Hang Dai:
Functional Faces: Groupwise Dense Correspondence Using Functional Maps. 5033-5041
Girum G. Demisse, Djamila Aouada, Björn E. Ottersten:
Similarity Metric for Curved Shapes in Euclidean Space. 5042-5050
Xinchu Shi, Haibin Ling, Weiming Hu, Junliang Xing, Yanning Zhang:
Tensor Power Iteration for Multi-graph Matching. 5062-5070
Yongxin Yang, Timothy M. Hospedales:
Multivariate Regression on the Grassmannian for Predicting Novel Domains. 5071-5080
Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang:
Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation. 5081-5090
Diego Marcos Gonzalez, Raffay Hamid, Devis Tuia:
Geospatial Correspondences for Multimodal Registration. 5091-5100
George Trigeorgis, Mihalis A. Nicolaou, Stefanos Zafeiriou, Björn W. Schuller:
Deep Canonical Time Warping. 5110-5118
Xianglong Liu, Xinjie Fan, Cheng Deng, Zhujin Li, Hao Su, Dacheng Tao:
Multilinear Hyperplane Hashing. 5119-5127
Olivier Canévet, François Fleuret:
Large Scale Hard Sample Mining with Monte Carlo Tree Search. 5128-5137
Jianwei Yang, Devi Parikh, Dhruv Batra:
Joint Unsupervised Learning of Deep Representations and Image Clusters. 5147-5156
Ming Yin, Yi Guo, Junbin Gao, Zhaoshui He, Shengli Xie:
Kernel Sparse Subspace Clustering on Symmetric Positive Definite Manifolds. 5157-5164
Chen Huang, Chen Change Loy, Xiaoou Tang:
Unsupervised Learning of Discriminative Attributes and Visual Representations. 5175-5184
Ha Quang Minh, Marco San-Biagio, Loris Bazzani, Vittorio Murino:
Approximate Log-Hilbert-Schmidt Distances between Covariance Operators for Image Classification. 5195-5203
Yongfang Cheng, Yin Wang, Mario Sznaier, Octavia I. Camps:
Subspace Clustering with Priors via Sparse Quadratically Constrained Quadratic Programming. 5204-5212
Yusuke Mukuta, Tatsuya Harada:
Kernel Approximation via Empirical Orthogonal Decomposition for Unsupervised Feature Learning. 5222-5230
Agata Mosinska-Domanska, Raphael Sznitman, Przemyslaw Glowacki, Pascal Fua:
Active Learning for Delineation of Curvilinear Structures. 5231-5239
Xavier Alameda-Pineda, Elisa Ricci, Yan Yan, Nicu Sebe:
Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion. 5240-5248
Canyi Lu, Jiashi Feng, Yudong Chen, Wei Liu, Zhouchen Lin, Shuicheng Yan:
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization. 5249-5257
Soheil Kolouri, Yang Zou, Gustavo K. Rohde:
Sliced Wasserstein Kernels for Probability Distributions. 5258-5267
Xian Wei, Hao Shen, Martin Kleinsteuber:
Trace Quotient Meets Sparsity: A Method for Learning Low Dimensional Image Representations. 5268-5277
Hisham Cholakkal, Jubin Johnson, Deepu Rajan:
Backtracking ScSPM Image Classifier for Weakly Supervised Top-Down Saliency. 5278-5287
Oral & Spotlight Session 4-2A
O4-2A: Learning and CNN Architectures
Relja Arandjelovic, Petr Gronát, Akihiko Torii, Tomás Pajdla, Josef Sivic:
NetVLAD: CNN Architecture for Weakly Supervised Place Recognition. 5297-5307
Ashesh Jain, Amir Roshan Zamir, Silvio Savarese, Ashutosh Saxena:
Structural-RNN: Deep Learning on Spatio-Temporal Graphs. 5308-5317
Yong-Deok Kim, Taewoong Jang, Bohyung Han, Seungjin Choi:
Learning to Select Pre-Trained Deep Representations with Bayesian Evidence Framework. 5318-5326
Soravit Changpinyo, Wei-Lun Chao, Boqing Gong, Fei Sha:
Synthesized Classifiers for Zero-Shot Learning. 5327-5336
S4-2A: Learning and Optimization
Zhuwen Li, Shuoguang Yang, Loong-Fah Cheong, Kim-Chuan Toh:
Simultaneous Clustering and Model Selection for Tensor Affinities. 5347-5355
Jinglin Xu, Junwei Han, Feiping Nie:
Discriminatively Embedded K-Means for Multi-view Clustering. 5356-5364
Ishant Shanu, Chetan Arora, Parag Singla:
Min Norm Point Algorithm for Higher Order MRF-MAP Inference. 5365-5374
Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang:
Learning Deep Representation for Imbalanced Classification. 5375-5384
Vijay Kumar B. G, Gustavo Carneiro, Ian D. Reid:
Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions. 5385-5394
Piotr Koniusz, Anoop Cherian:
Sparse Coding for Third-Order Super-Symmetric Tensor Descriptors with Application to Texture Recognition. 5395-5403
Jen-Hao Rick Chang, Aswin C. Sankaranarayanan, B. V. K. Vijaya Kumar:
Random Features for Sparse Signal Classification. 5404-5412
Oral & Spotlight Session 4-2B
O4-2B: 3D Shape Reconstruction
Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, In-So Kweon:
High-Quality Depth from Uncalibrated Small Motion Clip. 5413-5421
Michael Firman, Oisin Mac Aodha, Simon J. Julier, Gabriel J. Brostow:
Structured Prediction of Unobserved Voxels from a Single Depth Image. 5431-5440
Sean Ryan Fanello, Christoph Rhemann, Vladimir Tankovich, Adarsh Kowdle, Sergio Orts-Escolano, David Kim, Shahram Izadi:
HyperDepth: Learning Depth from Structured Light without Matching. 5441-5450
Ting-Chun Wang, Manmohan Chandraker, Alexei A. Efros, Ravi Ramamoorthi:
SVBRDF-Invariant Shape and Reflectance Estimation from Light-Field Cameras. 5451-5459
S4-2B: 3D Reconstruction
Nikolay Savinov, Christian Häne, Lubor Ladicky, Marc Pollefeys:
Semantic 3D Reconstruction with Continuous Regularization and Ray Potentials Using a Visibility Consistency Constraint. 5460-5469
Carolina Raposo, Joao P. Barreto:
Theory and Practice of Structure-From-Motion Using Affine Correspondences. 5470-5478
Silvano Galliani, Konrad Schindler:
Just Look at the Image: Viewpoint-Specific Surface Normal Prediction for Improved Multi-View Reconstruction. 5479-5487
Filip Radenovic, Johannes L. Schönberger, Dinghuang Ji, Jan-Michael Frahm, Ondrej Chum, Jiri Matas:
From Dusk Till Dawn: Modeling in the Dark. 5488-5496
Benjamin Eckart, Kihwan Kim, Alejandro J. Troccoli, Alonzo Kelly, Jan Kautz:
Accelerated Generative Models for 3D Point Cloud Data. 5497-5505
John Flynn, Ivan Neulander, James Philbin, Noah Snavely:
Deep Stereo: Learning to Predict New Views from the World's Imagery. 5515-5524
Oral & Spotlight Session 4-3A
O4-3A: Face, Gesture, & Situation Recognition: Algorithms and Datasets
Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang:
WIDER FACE: A Face Detection Benchmark. 5525-5533
Mark Yatskar, Luke S. Zettlemoyer, Ali Farhadi:
Situation Recognition: Visual Semantic Role Labeling for Image Understanding. 5534-5542
S4-3A: People and Faces
James Booth, Anastasios Roussos, Stefanos Zafeiriou, Allan Ponniah, David Dunaway:
A 3D Morphable Model Learnt from 10, 000 Faces. 5543-5552
Rasmus Rothe, Radu Timofte, Luc Van Gool:
Some Like It Hot - Visual Guidance for Preference Prediction. 5553-5561
Carlos Fabian Benitez-Quiroz, Ramprakash Srinivasan, Aleix M. Martínez:
EmotioNet: An Accurate, Real-Time Algorithm for the Automatic Annotation of a Million Facial Expressions in the Wild. 5562-5570
Shuxin Ouyang, Timothy M. Hospedales, Yi-Zhe Song, Xueming Li:
ForgetMeNot: Memory-Aware Forensic Facial Sketch Matching. 5571-5579
Karan Sikka, Gaurav Sharma, Marian Stewart Bartlett:
LOMo: Latent Ordinal Model for Facial Analysis in Videos. 5580-5589
Dipan K. Pal, Felix Juefei-Xu, Marios Savvides:
Discriminative Invariant Kernel Features: A Bells-and-Whistles-Free Approach to Unsupervised Face Recognition and Pose Estimation. 5590-5599
Peiyun Hu, Deva Ramanan:
Bottom-Up and Top-Down Reasoning with Hierarchical Rectified Gaussians. 5600-5609
David Joseph Tan, Thomas J. Cashman, Jonathan Taylor, Andrew W. Fitzgibbon, Daniel Tarlow, Sameh Khamis, Shahram Izadi, Jamie Shotton:
Fits Like a Glove: Rapid and Reliable Hand Shape Personalization. 5610-5619
Jing Shao, Chen Change Loy, Kai Kang, Xiaogang Wang:
Slicing Convolutional Neural Network for Crowd Video Understanding. 5620-5628
Spotlight Session 4-3B
S4-3B: 3D, Stereo, Matching, and Saliency Estimation
Florian Bernard, Peter Gemmar, Frank Hertel, Jorge M. Gonçalves, Johan Thunberg:
Linear Shape Deformation Models with Local Support Using Graph-Based Structured Matrix Factorisation. 5629-5638
Jayakorn Vongkulbhisal, Ricardo Silveira Cabral, Fernando De la Torre, João Paulo Costeira:
Motion from Structure (MfS): Searching for 3D Objects in Cluttered Point Trajectories. 5639-5647
Charles Ruizhongtai Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, Leonidas J. Guibas:
Volumetric and Multi-view CNNs for Object Classification on 3D Data. 5648-5656
Menghua Zhai, Scott Workman, Nathan Jacobs:
Detecting Vanishing Points Using Global Image Context in a Non-ManhattanWorld. 5657-5665
Chunyuan Li, Andrew Stevens, Changyou Chen, Yunchen Pu, Zhe Gan, Lawrence Carin:
Learning Weight Uncertainty with Stochastic Gradient MCMC for Shape Classification. 5666-5675
Duc Thanh Nguyen, Binh-Son Hua, Minh-Khoi Tran, Quang-Hieu Pham, Sai-Kit Yeung:
A Field Model for Repairing 3D Shapes. 5676-5684
Wenjie Luo, Alexander G. Schwing, Raquel Urtasun:
Efficient Deep Learning for Stereo Matching. 5695-5703
Yinlin Hu, Rui Song, Yunsong Li:
Efficient Coarse-to-Fine Patch Match for Large Displacement Optical Flow. 5704-5712
Shengfeng He, Rynson W. H. Lau:
Exemplar-Driven Top-Down Saliency Detection via Deep Association. 5723-5732
Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian L. Price, Radomír Mech:
Unconstrained Salient Object Detection via Proposal Subset Optimization. 5733-5742
Sina Honari, Jason Yosinski, Pascal Vincent, Christopher J. Pal:
Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation. 5743-5752
Saumya Jetley, Naila Murray, Eleonora Vig:
End-to-End Saliency Mapping via Probability Distribution Prediction. 5753-5761
Poster Session P4-2
Shaojing Fan, Tian-Tsong Ng, Bryan L. Koenig, Ming Jiang, Qi Zhao:
A Paradigm for Building Generalized Models of Human Image Perception through Data Fusion. 5762-5771
Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui:
Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines. 5772-5780
Srinivas S. S. Kruthiventi, Vennela Gudisa, Jaley H. Dholakiya, R. Venkatesh Babu:
Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation. 5781-5790
Yuxiang Zhou, Epameinondas Antonakos, Joan Alabort-i-Medina, Anastasios Roussos, Stefanos Zafeiriou:
Estimating Correspondences of Deformable Objects "In-the-Wild". 5791-5801
Vladislav Golyanik, Sk Aziz Ali, Didier Stricker:
Gravitational Approach for Point Set Registration. 5802-5810
Gang Wang, Zhicheng Wang, Yufei Chen, Qiangqiang Zhou, Weidong Zhao:
Context-Aware Gaussian Fields for Non-rigid Point Set Registration. 5811-5819
Magnus Oskarsson, Kenneth Batstone, Kalle Åström:
Trust No One: Low Rank Matrix Factorization Using Hierarchical RANSAC. 5820-5829
Chen Wang, Ramin Zabih:
Relaxation-Based Preprocessing Techniques for Markov Random Field Inference. 5830-5838
Yuhui Quan, Yong Xu, Yuping Sun, Yan Huang, Hui Ji:
Sparse Coding for Classification via Discrimination Ensemble. 5839-5847
Pierre Baqué, Timur M. Bagautdinov, François Fleuret, Pascal Fua:
Principled Parallel Mean-Field Inference for Discrete Random Fields. 5848-5857
Tat-Jun Chin, Yang Heng Kee, Anders P. Eriksson, Frank Neumann:
Guaranteed Outlier Removal with Mixed Integer Linear Programs. 5858-5866
Thalaiyasingam Ajanthan, Richard I. Hartley, Mathieu Salzmann:
Memory Efficient Max Flow for Multi-label Submodular MRFs. 5867-5876
Mingkui Tan, Shijie Xiao, Junbin Gao, Dong Xu, Anton van den Hengel, Qinfeng Shi:
Proximal Riemannian Pursuit for Large-Scale Trace-Norm Minimization. 5877-5886

Sohil Shah, Tom Goldstein, Christoph Studer:
Estimating Sparse Signals with Smooth Support via Convex Programming and Block Sparsity. 5906-5915
Na Qi, Yunhui Shi, Xiaoyan Sun, Baocai Yin:
TenSR: Multi-dimensional Tensor Sparse Representation. 5916-5925
Florian Jug, Evgeny Levinkov, Corinna Blasse, Eugene W. Myers, Bjoern Andres:
Moral Lineage Tracing. 5926-5935
Behrooz Nasihatkon, Frida Fejne, Fredrik Kahl:
Globally Optimal Rigid Intensity Based Registration: A Fast Fourier Domain Approach. 5936-5944
Haichuan Yang, Yijun Huang, Lam Tran, Ji Liu, Shuai Huang:
On Benefits of Selection Diversity via Bilevel Exclusive Sparsity. 5945-5954
Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian D. Reid:
Fast Training of Triplet-Based Deep Binary Embedding Networks. 5955-5964
Aayush Bansal, Bryan C. Russell, Abhinav Gupta:
Marr Revisited: 2D-3D Alignment via Surface Normal Prediction. 5965-5974
Ziad Al-Halah, Makarand Tapaswi, Rainer Stiefelhagen:
Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning. 5975-5984
Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham:
Modality and Component Aware Feature Fusion for RGB-D Scene Classification. 5995-6004
Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li:
PPP: Joint Pointwise and Pairwise Image Label Prediction. 6005-6013
Jan Dirk Wegner, Steve Branson, David Hall, Konrad Schindler, Pietro Perona:
Cataloging Public Objects Using Aerial and Street-Level Images - Urban Trees. 6014-6023
Francisco Massa, Bryan C. Russell, Mathieu Aubry:
Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views. 6024-6033
Ziming Zhang, Venkatesh Saligrama:
Zero-Shot Learning via Joint Latent Similarity Embedding. 6034-6042



Google
Google Scholar
MS Academic
CiteSeerX
CORE
Semantic Scholar
